Dimension of the speech space - Communications, Speech and Vision, IEE Proceedings I

نویسنده

  • M. D. Alder
چکیده

Tattersal et al. have described an attempt to use the Kohonen algorithm for locating a hypothetical two-dimensional speech space in a space of filter-bank values. The existence of this space is moot and its dimension is more moot, and the Kohonen algorithm itself does not yield any information on the intrinsic dimension of the set to which it converges. There is therefore some interest in trying to decide by other means whether or not the speech space does have an intrinsic dimension of two. This paper falls into two main sections. In the first, we define a statistic for estimating the intrinsic dimension of a finite set of points on the assumption that they lie on a smoothly embedded manifold, when, of course, the dimension is an integer. We test the method on finite sets drawn from known manifolds and show that it is robust. We also apply it to the Lorenz attractor, which is a well known set of nonintegral dimension. Finally we apply it to speech data of the same type as that used by Tattersal et al. We conclude that the speech space is not discernibly a low-dimensional manifold at all, and that a more plausible hypothesis is that the space is an open subset of the enclosing space. In the second section, we construct a measure of the extent to which the surface that the Kohonen algorithm fits to the speech space is buckled or crinkled related to the mean absolute curvature. The intention is to test (a) the hypothesis that the points of the speech space constitute a muralium, a twomanifold with noise, and (b) the hypothesis that the Kohonen process will find the muralium. We conclude that it is indeed possible to approximate the speech space with a low-dimensional manifold, but that it has dimension greater than two.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hermite polynomials for signal reconstruction from zero-crossings. 1. One-dimensional signals - Communications, Speech and Vision, IEE Proceedings I

Generalised Hermite polynomials are employed for the reconstruction of an unknown signal from a knowledge of its zero-crossings, under certain conditions on its spatial/spectral width, but dispensing with the assumption of bandlimitedness. A computational implementation of the proposed method is given for one-variable (or one-dimensional) signals, featuring an application of simulated annealing...

متن کامل

Efficient flow control in a multiclass telecommunications environment - Communications, Speech and Vision, IEE Proceedings I

Efficient (Pareto optimal) flow control in a multiclass telecommunications environment is analysed. Several classes of users compete for the resources provided by an exponential server, using power as their performance criterion. The necessary and sufficient conditions that the throughputs of the different classes have to satisfy are presented. Relations with definitions of global performance m...

متن کامل

Mental Timeline in Persian Speakers’ Co-speech Gestures based on Lakoff and Johnson’s Conceptual Metaphor Theory

One of the introduced conceptual metaphors is the metaphor of "time as space". Time as an abstract concept is conceptualized by a concrete concept like space. This conceptualization of time is also reflected in co-speech gestures. In this research, we try to find out what dimension and direction the mental timeline has in co-speech gestures and under the influence of which one of the metaphoric...

متن کامل

Multipath fading effects on carrier recovery of BPSK signal in digital radio - Communications, Speech and Vision, IEE Proceedings I

Tracking performance of the I-Q carrier recovery loop for BPSK signal in bandlimited channel with multipath interference is analysed. The multipath fading channel is characterised by the Rummler’s model (a simplified three-ray model) for line-of-sight digital radio system. A closed-form expression for the phaseerror variance related to the multipath interference, intersymbol interference and ch...

متن کامل

Fast algorithm for VQ codebook design - Communications, Speech and Vision, IEE Proceedings I

The paper presents a fast codebook training algorithm for vector quantisation. It uses an elimination rule, based on triangular inequality criteria, as well as the partial distortion elimination method, to relieve the computational burden of a conventional codebook training algorithm, including a binary codeword splitting algorithm for the initial codebook and the LBG recursive algorithm. Over ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009